Picture for Yaodong Yang

Yaodong Yang

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

Add code
Jun 01, 2026
Viaarxiv icon

SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning

Add code
Jun 01, 2026
Viaarxiv icon

MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models

Add code
May 28, 2026
Viaarxiv icon

GuardAD: Safeguarding Autonomous Driving MLLMs via Markovian Safety Logic

Add code
May 11, 2026
Viaarxiv icon

RedVLA: Physical Red Teaming for Vision-Language-Action Models

Add code
Apr 24, 2026
Viaarxiv icon

Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models

Add code
Apr 07, 2026
Viaarxiv icon

Policy Improvement Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Stable Reasoning, Unstable Responses: Mitigating LLM Deception via Stability Asymmetry

Add code
Mar 27, 2026
Viaarxiv icon

System Design for Maintaining Internal State Consistency in Long-Horizon Robotic Tabletop Games

Add code
Mar 26, 2026
Viaarxiv icon

ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling

Add code
Mar 18, 2026
Viaarxiv icon